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This article computes the capacity of various idealized soft-decision channels 
modeling an optical channel using an avalanche photodiode detector (APD) and 
pulse-position modulation (PPM). The capacity of this optical channel depends in a 
complicated way on the physical parameters of the APD and the constraints imposed 
by the PPM orthogonal signaling set. This article attempts to identify and separate 
the effects of several fundamental parameters on the capacity of the APD-detected 
optical PPM channel. First, an overall signal-to-noise ratio (SNR) parameter is 
defined such that the capacity as a function of a bit-normalized version of this 
SNR drops precipitously toward zero at quasi-brick-wall limits on bit SNR that 
are numerically the same as the well-understood brick-wall limits for the standard 
additive white Gaussian noise (AWGN) channel. A second parameter is used to 
quantify the effects on capacity of one unique facet of the optical PPM channel 
(as compared with the standard AWGN channel) that causes the noise variance to 
be higher in signal slots than in nonsignal slots. This nonuniform noise variance 
yields interesting capacity effects even when the channel model is AWGN. A third 
parameter is used to measure the effects on capacity of the difference between an 
AWGN model and a non-Gaussian model proposed by Webb (see reference in [2]) 
for approximating the statistics of the APD-detected optical channel. Finally, a 
fourth parameter is used to quantify the blending of a Webb model with a pure 
AWGN model to account for thermal noise. 

Numerical results show that the capacity of M-ary orthogonal signaling on the 
Webb channel exhibits the same brick-wall Shannon limit, (Mln2)/(M — 1), as on 
the AWGN channel (ss —1.59 dB for large M). Results also compare the capacity 
obtained by hard- and soft-output channels and indicate that soft-output channels 
offer a 3-dB advantage. 


I. Introduction 

Our objective in this article is to compute and compare the capacities of various idealized channels that 
might be used to approximate the optical communication system using M- ary pulse-position modulation 
(PPM) and an avalanche photodiode (APD) detector: (1) the standard additive white Gaussian noise 
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channel (here called AWGN-1); (2) a more general AWGN channel (AWGN-2) allowing different variances 
in signal and noise slots; (3) a Webb-distributed channel (Webb-2) modeling the output of an APD in 
the absence of thermal noise and surface leakage current; and (4) a blended Webb and Gaussian channel 
(Webb+ Gaussian) channel modeling Gaussian thermal noise and surface leakage current added to Webb-2 
channel outputs. 

In this article, we analyze in detail the capacities of these idealized channels in terms of certain 
fundamental channel parameters defined in such a way as to accentuate the similarities between the more 
complicated optical channel models and the familiar AWGN-1 channel for which capacity limits are well 
understood. We also compare the capacities achievable with soft- and hard-decision channel outputs. 
The appeal of using soft decisions lies in the ability to take advantage of better performing codes (e.g., 
turbo codes), which admit soft decoding algorithms. 

Our ultimate goal is to develop an understanding of the role of various optical parameters on the 
capacity of an optical communication system. A future article will attempt to utilize the conclusions 
drawn from the idealized models to develop an understanding of the role of various physical optical 
parameters on the capacity of an optical communication system. 


II. Modeling the APD with Webb and Gaussian Approximations 

In an optical communication system using an APD detector, the number of absorbed photons, n, is 
a Poisson random variable with mean n related to the total optical power, P(t), and to the observation 
time, T s , by n = (rj/hv) / Q Ts P(t)dt, where 77 is the detector’s quantum efficiency and hv is the photon 
energy. In response to n absorbed photons, the APD generates an “avalanche” of q electrons with 
a complicated conditional probability distribution derived by McIntyre (see reference in [2]), and the 
probability mass function p(q) is obtained by averaging this conditional probability over the Poisson- 
distributed n. Alternatively, p{q) can be approximated by a simpler continuous probability density derived 
by Webb (see reference in [2]). Furthermore, it is known [2] that the Webb density is well approximated 
by a Gaussian away from its tails, and that the approximation accuracy improves as n gets large. 

We begin by briefly discussing the McIntyre distribution and the various approximations to it that 
form the basis of the idealized channels to be studied in this article. 

A. Mclntyre-Conradi Model of the APD 

The actual number of photons absorbed is a Poisson-distributed random variable, n, with p(n) = 
(n n /n\)e~ n . The conditional probability p(q\n), q > n, n > 0, that the APD generates q electrons in 
response to n absorbed photons was derived by McIntyre and Conradi and can be found in [11]. This 
conditional probability distribution depends on several parameters of the APD detector, including its 
gain, G, and ionization ratio, k e ff. See Appendix A for physical quantities and notation. The Mclntyre- 
Conradi probability density function of q, given an average n absorbed photons, is obtained by averaging 
the conditional probability distribution p(q\n) over the Poisson-distributed n: 

PM(q ) = Vp(g|?r)^-re _n , q > 0 (1) 

' n\ 

n—0 


The full APD output also contains additive Gaussian noise contributed by thermal noise and surface 
leakage current. Thus, the total electron count x = q + q' at the output of the APD has a probability 
density function that is a convolution of the Mclntyre-Conradi density, Eq. (1), for q with a Gaussian 
probability density function for q'\ 
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x — q — m 


PM(q ) 


(2) 
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where </>(x) = (\ I \f 7 2/K)e~ x / 2 denotes the standardized (zero-mean, unit- variance) Gaussian probability 
density, and where [2] the mean m' and variance a' 2 of the convolving Gaussian density depend on the 
surface leakage current, I s , the thermal noise temperature, T , and other physical parameters. Again, see 
Appendix A. 
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B. The Webb Approximation 

The Mclntyre-Conradi distribution pm(q) in Eq. (1) can be approximated by a continuous probability 
density, Pw(q), derived by Webb (see reference in [2]): 


Pw(q) = 


\/2irnG 2 F 


1 + 


(q — Gn)(F — 1) \ _3/2 
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— ( q — Gn ) 2 
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( 4 ) 


where G is the APD gain and F = k e ffG + (2 — 1/G)(1 — /c e //) is an excess noise factor, given in terms 
of the gain and the ionization ratio, k e ff. 

The Webb-distributed electron count, q, is conveniently represented in terms of a standardized, scaled- 
and-translated Webb random variable, w. Defining q = m + wa , where m = Gn and a = y/ nG 2 F, the 
probability density for the standardized Webb random variable w simplifies to 


<j>(w, 6 2 ) 




g— w 2 /[2(1+w/S)]^ 


w > —6 


( 5 ) 


where S 2 = nF/(F — l) 2 . The standardized Webb probability density <f>(w,S 2 ) is plotted in Fig. 1 
for various values of the parameter S 2 , which determines the skewness of the density function around 
the origin. Note that this standardized Webb probability reduces exactly to a standardized Gaussian 
probability density when the parameter S 2 — * oo. In other words, the standardized Gaussian density <f>(x) 
can be regarded as the limiting case oo) of the family of standardized Webb densities (j)(w, 5 2 ). 

If w is a standardized Webb random variable with skewness parameter 6 2 and probability density 
given by Eq. (5), then q = m + wa is a Webb random variable with mean m, variance a 2 , and skewness 
8 2 , and is denoted as W(m, a 2 , 5 2 ). The standardized Webb random variable w has zero mean and unit 
variance and can be denoted as W( 0, 1, <5 2 ); higher moments of the standardized Webb probability density 
can be computed as polynomials in the parameter 1/S 2 , as shown in Appendix B. The Webb random 
variable q in Eq. (4), approximating the APD output without thermal noise and surface leakage current, 
is W(m, a 2 , S 2 ) with m = Gn, a 2 = nG 2 F, and S 2 = nF/{F — l) 2 . 
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Fig. 1. Family of standardized Webb probability density functions 
[see Eq. (5)]. 


C. The Webb+Gaussian Approximation 

With the Webb approximation to the Mclntyre-Conradi random variable q , the full output x = q + q' 
of the APD is then modeled as a sum of an electrical current due to the Webb-distributed electron count 
q, plus an independent Gaussian count q' due to thermal and surface leakage currents. It is convenient 
to define a standardized, zero-mean, unit-variance Webb+Gaussian random variable, WG(0, 1, <5 2 , 0) = 
\J0 W(0, l,(5 2 ) + v / l — 0 N(0, 1), that represents a blending of a pure Webb random variable, W(0, 1, A 2 ), 
with a pure Gaussian random variable, iV(0, 1). The blending parameter /3 measures the proportion 
of the total variance contributed by the Webb component. The standardized Webb+Gaussian random 
variable WG(0, 1, <5 2 , 0) has a probability density fi(x\ S 2 , /3) obtained as a convolution of scaled Webb 
and Gaussian densities: 


(/){x;6 2 ,P) 




1 

y/T=0 


x — w \ 


dw 


( 6 ) 


The standardized Webb+Gaussian probability density <j)(x: < 5 2 ; 0) is plotted in Fig. 2 for S 2 = 1 and 
various values of the blending parameter 0. We note that the pure Webb and pure Gaussian densities 
can be obtained as special cases of the Webb+Gaussian density: S 2 ) = S 2 , 1), </>(: x) = <j)(x; S 2 , 0) 

for any <5 2 , and <p{ x) = 0(x; oo, (3) for any /?. 

The full output x = q + q' of the APD is written in terms of the standardized Webb+Gaussian random 
variable as x = m + m' + \J cr 2 + a’ 2 WG(0, 1, <5 2 , 0), where 0 = a 2 1 (a 2 + a' 2 ), and m' and a' 2 are given 
by Eq. (3), as before. The probability density p(x ) for the full APD output is then 


p(x) 


1 

(J 2 + cr ' 2 


x — m — m! 
\/ a 2 + cr' 2 
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Fig. 2. Family of standardized Webb+Gaussian probability density 
functions [see Eq. (6)]. 


D. The AWGN Approximation 

When the skewness parameter 5 2 is large, the Webb random variable q in Eq. (4) can be approximated 
by a Gaussian random variable N(m,a 2 ) with the same mean, m = Gn , and variance, a 2 = nG 2 F. The 
corresponding Gaussian probability density for q is 

(7 » 

Furthermore, the full APD output x = q + q' is also Gaussian, N(m + m' ,a 2 + a' 2 ), and Eq. (2) simplifies 
to a single Gaussian probability density function: 


P(x ) 
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\/< 7 2 + a' 2 
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where [2] 
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E. The APD Models with PPM Signaling 

For PPM signaling, the mean number of photons absorbed in a given PPM slot depends on whether 
the signal is present or absent in that slot. The average number of photons, n , absorbed by an APD 
during a PPM slot of duration T s is given by 
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( 9 ) 


- _ V_ f Ta pu\^ t _ f «i ~ r](n b + n s ) signal present 
hv J Q y \ ho « r/hb signal absent 

where h& and h. s , are, respectively, the average number of background photons incident on the detector 
per PPM slot and the average number of signal photons incident on the detector per signal pulse. Here 
the approximations in relating hi and ho to the physical parameters ignore small contributions from 
bulk leakage current and incomplete modulation extinction. See Appendix A for physical quantities and 
notation. A more complete analysis of the dependence of the optical capacity on the physical parameters 
will be given in a future article. In the current article, the general capacity expressions are derived without 
explicitly relating hi and ho to the physical quantities, except for relying on the fact that hi > ho- The 
approximations in Eq. (9) are only used later to relate our numerical evaluations of the general capacity 
expressions to realistic physical parameter values. 

The Webb model for PPM signaling (here called Webb-2) uses the density in Eq. (4) with two different 
values of h: once with the average number hi of photons in the signal slot and a second time with the 
average number h 0 of photons in the M — 1 non-signal slots. Similarly, the AWGN approximation for 
PPM signaling (here called AWGN-2) uses the density in Eq. (7) or Eq. (8) twice, with h = hi in the 
signal slot and h = ho in the M — 1 nonsignal slots. 

Since the mean to and variance cr 2 of the Webb-distributed electron count q are both proportional to 
h, and hi > ho, it follows that the mean to i and the variance cr 2 of q in the signal slots are no smaller 
than the corresponding mean too and variance cr§ in the nonsignal slots. The additional mean m' and 
the additional variance u' contributed by thermal noise and surface leakage current is the same for both 
signal and nonsignal slots. Just as a single Webb density with variance cr 2 approaches a Gaussian density 
with the same variance when S 2 — > oo, the two different Webb densities characterizing the signal and 
nonsignal slots can each be approximated by Gaussian densities if their individual skewness parameters Sf 
and 5q become large. Thus, a Gaussian approximation for the optical PPM channel leads to a somewhat 
more general Gaussian channel than the familiar additive white Gaussian noise (AWGN) channel. In this 
article, we examine two separate versions of the Gaussian problem, the familiar Gaussian model with a 
common variance in all of the slots (here called AWGN-1) and also a “double Gaussian” model (AWGN-2) 
with greater variance in the signal slot than the common variance in the nonsignal slots. 

F. Fundamental Parameter Definitions for PPM Capacity Calculations 

In our PPM capacity derivations in the next section for the various idealized channel models, we 
find that the following four fundamental parameters give useful insights into the capacity limits derived 
for these models. A signal-to-noise ratio (SNR) parameter, p 0 = (mi — too) 2 /(<7o + c' 2 ), measuring 
the ratio of the energy in the difference of the mean counts in the signal and nonsignal slots to the 
variance in the nonsignal slots, plays a role similar to that of the channel symbol SNR, E s /Nq, for the 
standard AWGN-1 channel. Next, a second SNR parameter, p + = (mi — too) 2 /(<t 2 — CTq), measured with 
reference to the excess noise variance in the signal slots above that in the nonsignal slots, determines 
how closely the AWGN-2 model resembles the classic AWGN-1 model. A third parameter, A = <5 2 — <5 q, 
measures the degree to which the Webb-2 model differs from the AWGN-2 model. A fourth parameter, 
Po = 0o /(oo + c - ' 2 ), measures the blending of the Webb and Gaussian components in the nonsignal slots. 
Finally, a derived parameter, p b , obtained from po and the computed capacity, plays a role similar to that 
of the bit SNR, E b /No, for the standard AWGN-1 channel. 

The usefulness of these parameter definitions is that they tend to isolate three types of different 
effects on capacity due to using the more complicated Webb-2 model versus the familiar AWGN-1 model: 

(1) the normal monotonic variation of capacity with the signal-to-noise ratio supplied to the channel; 

(2) the sometimes strange effects on capacity that have nothing to do with the Webb density but are 
solely due to the unequal variances in the signal and nonsignal slots; and (3) effects on capacity when 
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the Webb probability density is not adequately approximated by a Gaussian, or when the total density 
blends Webb and Gaussian components. 

By plugging in h for the signal and nonsignal slots from Eq. (9), we find that the three parameters 
for a pure Webb-2 channel, modeling the APD output without thermal noise and surface leakage current, 
are given in terms of the physical optical system parameters by 


(mi - m 0 ) 2 _ (ni - n 0 ) 2 _ 7?n 2 
n 0 F n b F 


n - A 


p+ = 


(m i — mo) 2 ni — no ijn s 


2 2 
erf - Og 


F 


F 


A = 5 2 ^ Sq 


( n r - n 0 )F r]n s F 


(F- l) 2 (F — l) 5 


(10) 


We note from these equations that, for the optical channel, the parameter A, measuring the deviation of 
a Webb density from a Gaussian approximation, and the parameter p + , measuring the effects of unequal 
variances, are different by a factor depending only on the excess noise factor F, i.e. , A = p+F 2 /(F — l) 2 . 
Furthermore, the fundamental SNR parameter po is approximately equal to p+ times a factor n s /rib that 
is often routinely considered to be the proper SNR measure for the optical channel. Because the variance 
er 2 and the skewness parameter S 2 on the optical APD channel are each linearly proportional to the 
number of absorbed photons, this imposes an additional constraint that 


= $1 = no = P+ 
a\ Sf ni po + p+ 


( 11 ) 


Finally, this allows us to express the Webb-2 skewness parameters Sq and S 2 in terms of the fundamental 
capacity parameters p o, p+, and A as 


Po 


§ 2 = P0±P+ a 

Po 


(12) 


For the Webb+Gaussian channel, modeling the full APD output, including the effects of thermal noise 
and surface leakage current, we find that the fundamental parameters and A are exactly as given 
above, while the definition of the primary SNR parameter po takes into account the additional common 
variance <j' 2 contributed by the Gaussian thermal noise and surface leakage current, and the blending 
parameter /3 q measures the proportion of the total variance contributed by the Webb component: 


Po = 


(?ni - mo) 2 /3o(«i - «o) 2 


erz + o'- 


n 0 F 


Po — 


cr n + a 
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(13) 
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The parameter relationships noted in Eq. (11) for the pure Webb-2 channel remain valid for the 
Webb+Gaussian channel if po is replaced by po/ Po- In this case, the analog to Eq. (11) is 


^o__So__Po_no_ PoP+ 
a? ~ 6f ~ p 1 “ ni po + PoP+ 


and we can express Sq, Sf, and Pi in terms of the other parameters as 


X 2 _ PoP+ 
°o ~ 

Po 


A 


^2 _ PO + PoP+ 
1 Po 


Pi = Po + — 
p+ 


A 


(14) 


(15) 


When the AWGN-2 model is used as in Eq. (7) to model the pure Webb-2 channel, then the parameters 
for this channel are po and p + , defined by Eq. (10). When the AWGN-2 model is used as in Eq. (8) to 
model the full APD output, the parameters are po , given by Eq. (13), and p + , given by Eq. (10). 

Notice that the gain, G, of the APD does not enter into the definitions of the parameters p+ or A 
or the primary SNR parameter p 0 for the pure Webb-2 model. The APD gain G only enters into the 
relative weighting of the (Tq and the cr' 2 variances (i.e., the parameter /3 0 ) in the definition of po for the 
Webb+Gaussian model (or for the AWGN-2 model that approximates the Webb+Gaussian model). 


III. Capacity of M-ary PPM on Channels with Soft Outputs 

For each channel model, we consider the communication system shown in Fig. 3. The output 
U = (Ui, U2, ■ ■ ■ , Up) of a fc-bit source is modulated with (M = 2 fe )-ary PPM to yield a signal 
X = (Xi, X2, ■ ■ ■ , Xm)- A PPM modulator is equivalent to an encoder producing the M = 2 k code- 
words of a (2 fe ,fc) orthogonal code. Thus, the capacity of M-PPM is the same as that of an orthogonal 
signal set with M codewords, and of M- ary frequency shift keying (M-FSK). 

Since X is an invertible function of U, the observation of Y provides an average /( Y;X) bits of 
information about the input U, where /( Y ; X) is the mutual information between Y and X. The capacity 
of PPM modulation on the channel is the maximum amount of information that can be transmitted 
reliably and is given by C = max p ( X ) /(Y ; X). Because of the symmetry of orthogonal signals and of the 
channels considered, capacity is achieved with an equiprobable M-ary source distribution p(X = x) = 
1/M, Vx e {xi,x 2) - • • ,x M }- 


u 

M - ary 

X 

CHANNEL 

Y 


PPM MODULATOR 




Fig. 3. Model for PPM capacity calculations. 


8 


The channel capacity with input signals restricted to an M - ary orthogonal constellation C , and no 
restriction on the channel output, is given by 



p( yl x ) 


^E iec p(yl*) 


dy 


(16) 


where y = (j/i, • • • ,j/m) is the received vector. Because of the symmetry of orthogonal codes with respect 
to the inputs {xi, • • • , xm}, this expression reduces to 


C = J p{ y|x!)log 2 j -j 


p(y|xi) 


dy 


M ExecP(yl*) 


= log 2 M — .E y | Xl log 2 


E^iP(yl x j) ~ 

p(y|xi) 


(17) 


= log 2 M - T; v | Xl log 2 


Ej= 1 p( v l x j) ' 

p(v|x!) 


(18) 


where v is a random vector obtained from y via an arbitrary invertible transformation. 

The capacity formula in Eq. (17) can be computed in principle, as long as p(y |x 7 ) is known for all x ; . 
However, uninspired computation of the expectation of the right side of Eq. (17) requires evaluation of 
an M-dimensional integral. Fortunately, this M -dimensional expectation can be accurately estimated via 
Monte Carlo simulation for all of the channels considered. The method is simply to generate pseudoran- 
dom M-vectors y according to the probability density, p(y|xi), then to evaluate the logarithmic function 
inside the expectation for each sample y thus generated, and finally to average the computed logarithm 
over many pseudorandom samples. This Monte Carlo method was the most practical method to calculate 
numerical capacities for all of the channels considered below, whenever M was larger than about 4. 

The subsections below give the explicit forms of the channel transition probability function p(y |x ? ) for 
each of the channels considered, along with some channel-specific simplifications in the capacity formulas. 

A. Capacity of M- ary PPM on the Standard AWGN Channel (AWGN-1) 

In this case, the possible signals X are of the form x, = (xj i, • • • , xjm) = (0, 0, 0, • • • , 0, m, 0, • • • , 0), 
where the nonzero signal value m is in position j. The transmitted vector x is corrupted by additive 
white Gaussian noise with zero mean and variance cr 2 in each component. Thus, given a transmitted 
signal x = Xj, the components of the received vector y are conditionally independent Gaussian random 
variables, identically distributed except for yy. 


yj is N(m,a 2 ) (signal present) 

yi is N(0,a 2 ), i^j (signal absent) 

This is the model for any set of Af-ary orthogonal signals, with energy per Af-dimensional symbol 
E s = m 2 , transmitted on an AWGN channel with two-sided noise spectral density N 0 / 2 = a 2 . Note that, 
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in the case of PPM signaling, E s refers to the energy per PPM symbol, not per PPM slot. A symbol 
signal-to-noise ratio can be defined by p = m 2 / a 2 = 2E S /N 0 . 2 

For the distributions in Eq. (19), we have 


p( yl x j) 



M/2 

exp 


(yj - m) 2 


M 

J| ex P 

i= 1 



Defining v :j = yj /u, we obtain 


P( v l x j) 

p(v|xi) 


= exp 



(Vj - s/p) 2 ~v 1 + (ui - s/pf 


(20) 


(21) 


The random variables {vj} are conditionally independent, given {xi}, and distributed as 

vi is N(y/p, 1) 

vj is N(0, 1), j/1 

Using Eq. (21) in Eq. (18), we get an expression for the capacity of orthogonal signaling on the AWGN-1 
channel: 



M 

C(p) = log 2 M - E v | X1 log 2 ^2 exp Wp{vj - ui)] (23) 

3 = 1 

For M-PPM on the AWGN-1 channel, both the capacity in Eq. (23) and the uncoded probability of 
symbol error (see Section V) are functions of the single parameter p = m 2 /a 2 , not of m and a separately. 
This is a statement of the well-known fact that the AWGN-1 channel is fully characterized by its SNR 
and does not depend on an overall scaling of the signal and noise. 

For large M, the law of large numbers allows us to approximately replace the last M — 1 terms 
in Eq. (23) by M — 1 times their mean (see derivation in Appendix C). Then the AWGN capacity can be 
approximated as 3 


C{p) « log 2 M - E V1 | X1 log 2 


1 + (M — l)e p/2 e“^ Ul 


(24) 


where Vi is N(y/p, 1) given xi. 

B. Capacity of M-ary PPM on a More General Gaussian Channel (AWGN-2) 

Now we extend the analysis to cover a “double Gaussian” problem (here called AWGN-2) related 
more directly to the PPM optical model and characterized by different means and variances depending 
on whether the signal is present or absent. 


2 Note that p defined in this manner is 3 dB higher than the usual symbol SNR defined as E s /No. 

3 A similar result for noncoherent Af-FSK was derived in [1], 
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Given a transmitted signal x = x ; , the components of the received vector y are conditionally inde- 
pendent Gaussian random variables, identically distributed except for yj-. 


yj is N{m\,a 2 ) (signal present) 

yi is iV(mo, 0 q), 15 fa (signal absent) 

where mi > mo and <j\ > Ug. By symmetry of the orthogonal PPM signal constellation, capacity can be 
evaluated by Eq. (17). For the distribution in Eq. (25), we have 



p(yfa) 


1 \ (M-l)/2 , 1 X 1/2 

2^J (fafaj 6XP 


fa ~ wi ) 2 

2a\ 


M 

n«p 

i= 1 


fa ~ wp ) 2 


(26) 


Thus, 


p(yfa) 

p(y|xi) 


= exp 


v j ~Uj ~ v i+ u\ 
2 


(27) 


where u,- = (yj — mo)/cro and Uj = (yj — rni)/cri. 

We define a symbol SNR parameter, p 0 = (mi — mo) 2 /(XQ, analogously to the SNR parameter p for 
the AWGN-1 channel, where the reference noise level in the AWGN-2 case is the variance <Tq in the noise 
slots. We also define a second SNR parameter, p + = (mi — mo) 2 /{a\ — ctq), where the reference noise 
level is the excess noise variance in the signal slot compared with that in the noise slots (guaranteed to be 
nonnegative since a\ > u 2 ). The AWGN-2 problem reduces to the AWGN-1 problem in the limit as this 
second SNR parameter p + — > 00 . With these definitions, the fa} are conditionally independent given 
xi, and distributed as 


at ( / — Po + P+ 
v\ is N [ faH, 

V P+ 

Vj is 1V(0,1), jfa 1 

In terms of the fa}, the {uj} are determined by the invertible transformation: 


(28) 


P+ 

Po + P+ 


fa - VPo) 


(29) 


Thus, we have 


Uj ± Vj = Vj 


P+ 


Po + P+ 



PoP+ 
Po + P+ 


(30) 


Noting that u 2 — v 2 = (uj + Vj)(y,j — Vj), we use the expressions for Uj ± Vj to calculate the exponent in 
Eq. (27), and obtain the capacity from Eq. (18) in the form 
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(31) 


M 


C(po,p+) = log 2 M - E v | X1 log 2 Y ex P 


3 = 1 


P+ 


P0 + P+ 


y/Po{Vj - «l) + 


Po 
P0 + P+ 


2 2 
^ - vf 


This equation reduces to the standard AWGN-1 capacity for orthogonal signals, Eq. (23), in the limit as 
p+ — > oo for a given po- 

For M-PPM on an AWGN-2 channel, both the capacity in Eq. (31) and the uncoded proba- 
bility of symbol error (see Section V) are functions of the parameters po = ( m i — m O)/ a o and 
p + = (mi — too) 2 /(cti — <Tq). This is a statement that the AWGN-2 channel is fully characterized 
by just two parameters, an overall SNR parameter, p 0 , and an additional SNR parameter, p + , measured 
relative to the excess variance in the signal slots versus that in the nonsignal slots. The four individual 
parameters mo, mi, <Tq, and a\ defining the AWGN-2 channel do not affect the capacity independently. 
The reduction from four to two parameters in this case implies that the capacity is not affected by either 
an overall scale factor or a common displacement of the means. 

In the limit as p + — > oo for a given po, the problem reduces to the familiar AWGN-1 channel. At 
the other extreme, in the limit as p_|_/p 0 — > 0, (i.e., the average background count, hf,, is insignificant 
compared with the average signal count, n s ), the conclusions resemble those derived for a pure Poisson 
quantum-limited model, and there is no fundamental lower limit on the ratio of signal energy to the 
variance contributed by the signal alone. 

For large M, the AWGN-2 capacity can be approximated as (see derivation in Appendix C) 


C(po, P+) = log 2 M - E Vl | X1 log 2 


n /P0 + P4- ( PoP+ - ^P+y/~PoVi ~ Povf 

nl P+ V 2 (po + p + ) 


where Vi is N (^pH, [p 0 + p+]/p+) given x : . 

C. Capacity of M-ary PPM on Webb-Distributed Channels (Webb-1 and Webb-2) 

The Webb-1 channel model simply substitutes Webb random variables W (•,•,•) for the Gaussian 
random variables N(-, •) in Eq. (19) for the AWGN-1 channel model. Given a transmitted signal x = Xj, 
the components of the received vector y are conditionally independent Webb random variables, identically 
distributed except for yj: 


yj is W(m, a 2 , S 2 ) (signal present) 

■yt is W(0, er 2 , <5 2 ), i^j (signal absent) 

That is, the conditional probability density functions are 


P{vMi) 


1 

a 



p(yi\xj) = ^ 


where </>(•; •) is given in Eq. (5). Thus, for the Webb-1 model, 


(32) 
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p(yl x j) 

p(yl x i) 


a~ M $ 


V i-m 2 


1V1 

im*=- 


4>{vj - \fP'i S 2 )4>{vi\8 2 ) 
4>{v\ - y/p; S 2 )^-, 5 2 ) 


where, as in the AWGN-1 channel, v 3 = tjj /er and p = m 2 /a 2 . These variables {vj} are conditionally 
independent, given {xi}, and distributed as 


vi is W (y/p, 1, S 2 ) 

Vj is W(0,l,5 2 ), j/1 


The capacity of the Webb-1 channel is given by plugging into Eq. (18): 

(j> (vj — J~p\ 5 2 ) ;S 2 ) 

C(p, S) = log 2 M - E V , X1 log 2 Y. ) V ' A ,) J (35) 

<P{ V 1 ~ ) <t>{ V 3> S ' ) 

The Webb-2 channel model substitutes Webb random variables for the Gaussian random 

variables in Eq. (25) for the AWGN-2 channel model. Given a transmitted signal x = x ; , the 

components of the received vector y are conditionally independent Webb random variables, identically 
distributed except for yj: 


y 3 is W(mi,a 2 ,S 2 ) (signal present) 

yi is W(?tio, erg, (5g), i^j (signal absent) 


The conditional probability density functions are 


piyMi) = — <t> (— — — ; si 


0 1 V cr 1 


p(yi |xj) = — <t> (— — — ; s%) ,i ^ j 


a o \ &o 


where <(>(•; •) is given by Eq. (5). Following the same method as in Eq. (33), we have 

p(yl x j) = <l>(uj;Sl)<l>(vi]6%) 

P(y| x i) <t>{ui-,Sf)<l>(vj-,6%) 

where, as in the AWGN-2 channel, Vj = ( yj — TOo)/ao and Uj = ( yj — Using Eq. (29), this 

becomes 


p(yl x j) 

p(yl x i) 


Po + p+ 


Po + p+ 


{Vj - Vpo) ;Sl)(t> (t)i 


(vi - v^o) ;^1 ) 
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where again po = (mi — mo) 2 /tr§ and p+ = (mi — mo) 2 /(erf — ctq). Defining A = Sf — S§ and using 
Eq. (12) to eliminate Sf and S/, we obtain the capacity of the Webb-2 channel by plugging into Eq. (18): 


M 


C(po, P+A) = log 2 M - E v | X1 log 2 ^2 

i=i 


P+ ( , — \ do + P+ A \ , ( P+ A 

; fa - y/pS ) ; A <f>[v i; — A 

do + d+ Po J \ Po 


P+ (vi-^pS)-,^^ A) cj) ( Vj\ —A 


Po + P+ 


Po 


Po 


(39) 


In these expectations, the {vj} are conditionally independent, given {xi}, and distributed as 


w : , — do + P+ do + d+ A 

ui is W I y/po, , A 


d+ 


do 


^ is W[ 0,lAA|, j/1 
' do 


(40) 


D. Capacity of M-ary PPM on a Webb+Gaussian Channel 

The Webb+Gaussian channel model substitutes Webb+Gaussian random variables WG(-, •,•,•) for the 
Webb random variables W(-,-, -) in Eq. (36) for the Webb-2 channel model. Given a transmitted signal 
x = Xj , the components of the received vector y are conditionally independent Webb+Gaussian random 
variables, identically distributed except for yy. 


yj is WG(mi, erf, Sf, (3\) (signal present) 

yi is WG(mo, cd, dp, Po), i ± j (signal absent) 

The conditional probability density functions are 

P(Vj\xj) = — (— — — 5 a ) 

P(Vi\^j) = — ( —',$o’Po), 

&o \ co J 

where </>(•; •, •) is given by Eq. (6). Following the same method as in Eq. (37), we have 



d(y|xj) = <l>(uj;Sf,/3 i)<l>(vi',5o,Po) 
P(y|xi) <f>(ui',8hPi)<l>(v j -,So’0o) 


where, as for the AWGN-2 channel, Vj = (yj — mo)/vo and Uj = (Uj — m\)/a\. Using Eq. (29), this 
becomes 


d(yl x i) 

p(y|xi) 


</> (^\J po P + p+ ( Vj - 0(m;<5o,/3o) 

(^\J po P + p+ ( Vl ~ Vp-oySlPr) </>(vj;5%,Po) 


(43) 
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where pg = (mi — mo) 2 /(c>q + cr' 2 ) and p + = (mi — mo) 2 /(cr 2 — Oq). Again defining A = — Sq, and 

using Eq. (15) to eliminate <5 2 , Sg, and Pi, we obtain the capacity of the Webb+Gaussian channel by 
plugging into Eq. (18): 


C(po , P+, A, Po) — 


M 


log 2 M - E v | X1 log 2 ^2 
i = i 


P+ / , 1 Po + P+ AO . Po\ A f P+ A n 

: (Vj - s/po)\ A,Po-\ p[vi\ — A, Po 

Po + P+ Po P+ / V Po 


-^—(vi-Jp~o)-,^ ±J!± \Po + — ) <t> —A,Po 
po + p+ po p+y V Po 


(44) 


In these expectations, the { Vj } are conditionally independent, given {xi}, and distributed as 


rrr<a/ , PO + P+ PO + P+ » n PO N 

Vi is WG(^/pf, , A, Po H ) 

P+ Po P+ 

Vj is IFG(0, 1, — A,p 0 ), i/1 
Po 


(45) 


IV. Bit-SNR Capacity Limits of M-ary PPM on Channels with Soft Outputs 

In the case of the classic AWGN-1 channel, the capacity formulas imply a well-known threshold on 
the minimum required signal-to-noise ratio (SNR) per information bit communicated over the channel. 
If the AWGN-1 channel SNR is E a /Ng (per channel symbol), the corresponding bit SNR is computed as 
Eb/N q = ( E s /N 0 )/R (per information bit), where R (information bits/channel symbol) is the rate of the 
overall code applied to the channel. If the rate is at the capacity limit, then R = C, and the formula for 
the minimum possible bit SNR is ( Eb/Ng) m i n = ( E s /No)/C . For the AWGN-1 channel, it is well known 
that reliable communication is possible with appropriate coding if and only if (Eb/Ng) m i n does not drop 
below a brick-wall threshold of —1.59 dB. Higher thresholds apply if the code rate is constrained away 
from zero or if there are constraints on the modulation format. 

In this section, we find numerically that all of the channels considered in the previous section have 
quasi-brick- wall bit-SNR thresholds analogous to the well-known thresholds for the classic AWGN-1 
channel. To unify the treatment of these channels, we define a minimum bit-SNR parameter pb = p/ (2 C) 
or pb = po/(2C) for each channel. Note that this definition for the AWGN-1 channel reduces to pb = 
(E s /N 0 )/C = ( Eb/N 0 )mim so graphs of capacity versus pb have exactly the same interpretation in this 
case as classic AWGN-channel graphs of capacity versus ( E b /No) m in ■ 

A. The AWGN-1 Channel 

Exact computation of C for large dimensions is extremely complex, and it is necessary to resort to 
Monte Carlo methods as described in [5, Appendix I]. Figure 4 is an evaluation of PPM capacity on the 
AWGN-1 channel obtained by Monte Carlo simulation of Eq. (23) for M = 2,4,8,16,32,64,256. The 
figure shows AWGN-1 capacity as a function of the minimum required bit SNR pb = p/(2C) = Eb/Ng. 
We see from this figure that the restriction to an M- ary signal set causes the capacity curves for M-PPM 
to saturate at log 2 M bits, no matter how much SNR is supplied. On the other end of the scale, there 
is a characteristic minimum value of Eb/Ng for each M below which the capacity drops to zero. These 
minimum values result directly from restricting the M-dimensional signaling set to be the orthogonal set, 
and are discussed more in Section VI. When these capacity results are plotted on a (vertical) log scale 
(e.g., see Fig. 7 later), each capacity curve appears to approach a vertical brick wall at the minimum 
attainable value of E b /N 0 . 
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Fig. 4. PPM capacity on the AWGN-1 channel, determined by 
Monte Carlo simulation. 

Figure 5 gives a subset of the same capacity curves in Fig. 4, except that in this case the curves for 
M = 2, 4 were computed exactly from Eq. (23) using Mathematica. Figure 5 also overlays several related 
performance curves for comparison. First, in the case of 256-PPM, the analytic Approximation (24) is 
compared with the more exact evaluation of Eq. (23) using Monte Carlo simulation. Next, the figure 
superimposes a set of points (artificially connected by a curve) showing the actual signaling rate (not 
capacity) in bits per channel use versus the required Eb/Ng for uncoded M-PPM to achieve a bit-error 
probability (BER) of Pb = 10 -6 , where Pb = (1/2)(M/[M — 1])Pm, and the probability of uncoded 
symbol error, Pm, is given later in Eq. (48). The horizontal displacements between this curve and the 
family of capacity curves show how much room there is for improvement over an uncoded system at 
this BER. Finally, the figure shows (as dashed curves) the capacity formulas C = (fe/2)log 2 (l + p) = 
(k/2) log 2 (l + 2REb/No) for fc-dimensional Gaussian channels unrestricted as to modulation type, which 
can be explicitly solved at the capacity limit R = C for ( Eb/N 0 ) m i n = (2 2C ! k — 1)/2C. 

B. The AWGN-2 Channel 

Figure 6 shows the capacity of 256-PPM obtained by Monte Carlo integration of Eq. (31) for the 
AWGN-2 model, plotted for different values of p+ versus the bit-normalized SNR parameter defined by 
Pb = Po/(2C). Plotted (on a log scale) versus pb, the AWGN-2 capacity curves for different values of 
p + all saturate at 8 bits per channel use (since this is for 256-PPM), and they all appear to approach 
the same brick-wall threshold of —1.57 dB on the minimum attainable value of pb for 256-PPM, at least 
down to capacity levels below about 1 bit per channel use. At low values of p + (e.g., p + = 5 in the figure, 
which corresponds to a physical parameter r]fi s = 10.8 with an APD excess noise factor of F = 2.16), the 
capacity curves appear to approach the same asymptotes but the curvature in between is less pronounced 
and the capacity for fixed pb is less than for the pure AWGN-1 model. When p + is made really small, 
typically corresponding to an average of less than one absorbed photon per slot, we have noticed that the 
capacity curves do not stop at the brick-wall limit of —1.57 dB, i.e., there is a very small nonzero capacity 
for pb below —1.57 dB. This effect probably also shows up for any finite p + , but at capacity levels too 
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Fig. 5. PPM capacity on the AWGN-1 channel, compared with 
other performance curves. 
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Fig. 6. 256-PPM capacity on the AWGN-2 channel, deter- 
mined by Monte Carlo simulation. The case p + = °c is equiv- 
alent to the AWGN-1 channel. 
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low to be simulated accurately (much less than 1 bit per channel use). This breaching of the brick- wall 
threshold for very low values of capacity per channel use needs to be studied in more depth whenever we 
extend our results to the Poisson limit (corresponding to small p. j_ as compared with po). 

C. The Webb Channels 

We evaluated the M-dimensional expectations in Eq. (39) accurately via Monte Carlo simulation. Some 
results are plotted in Fig. 7 for the Webb-2 channels for different PPM orders M. Corresponding curves 
for the AWGN-1 channel are shown for comparison. The abscissa in this figure is the normalized bit SNR, 
Pb = p/{ 2C) for AWGN-1 and pb = p$/(2C) for Webb-2. Along each Webb-2 curve, the two independent 
variables held constant are A = 60.8 and p + = 17.6, which correspond to a representative optical APD 
problem with r]fi s = 38 detected signal photons per PPM word and an excess noise factor F = 2.16. 
The x-axis in this figure also shows “simplex-to-orthogonal” penalty factors, which are discussed later in 
Section VI. 

We see from this figure that the capacity curves for the more complicated Webb-2 model again resemble 
those for the well-known AWGN-1 model, as they bend around from horizontal asymptotes at log 2 M 
bits per channel use to an apparent vertical asymptote at pb = 101og 10 [(Mln2)/(M — 1)] dB. Again, as 
with the AWGN-2 capacity curves, there is evidence that the vertical asymptote is breached at very low 
values of capacity per channel use, but this is not noticeable in simulations except when p + is also very 
small. 

Figure 8 gives the Webb-2 capacity curves for the same set of physical parameters considered in Fig. 7, 
except that in this case the capacity is plotted versus the average background count, fib, instead of versus 
Pb . In each case, the capacity curves start at their asymptotic limit, log 2 M for small values of fib and 
then plummet toward zero as hf, is increased above n s . 

Figure 9 plots the capacity of 256-PPM on a Webb-2 channel for various values of fi s = Fp + /r]. Again 
the abscissa is the bit-normalized SNR parameter pb = pq/{2C). Note that the Webb-2 capacity curves 


-1.59 dB 



BIT SNR P £> , dB 


Fig. 7. Capacity of AWGN-1 and Webb-2 channels for different 
PPM sizes. 
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Fig. 8. Capacity of M-PPM on a Webb-2 channel. 



BIT SNRp b ,dB 

Fig. 9. Capacity of 256-PPM for AWGN-2 and Webb-2 channels. 
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in Fig. 9 approach the capacity curve of the AWGN-2 channel (shown for reference) more closely as n s , 
or, equivalently, gets large. 


V. Capacity of M-ary PPM on Channels with Hard Outputs 

For the AWGN-2 and Webb-2 channels, we also computed the capacity with hard-decision channel 
outputs. When a hard-decision detection scheme is used, the decoder operates on PPM symbol decisions 
from the demodulator, not individual soft counts. The modulator output is the PPM symbol having 
the maximum slot count [10]. This hard-decision channel is an M- ary input, M- ary output, symmetric 
channel with capacity given by [3] 


C = log 2 M + (1 - P M ) log 2 (l - P M ) + Pm log 2 ( 46 ) 

bits per channel use, where Pm is the probability of incorrect symbol detection. 

The probability of incorrect symbol detection is calculated as Pm = 1 — Pr[y ; - < yi,Mj ^ l|xi] = 
Pr[r)j < V \ , V) ^ l|xi] for the various definitions of {yj} and {vj} used in determining the capacities of 
the AWGN and Webb channels. Given the conditional independence of the observations in the M — 1 
nonsignal slots, this reduces to 


Pm = 1--E Ui | Xi PoCci) M-1 (47) 

where Po(u) is the cumulative probability distribution function for the normalized channel symbols {vj} in 
the nonsignal slots. Using the conditional statistics of v\ given x x in Eqs. (22), (28), (34), (40), and (45), 
we can find the probability of uncoded M-PPM symbol error, Pm, for the various channels considered in 
Section III. 

For the AWGN-1 channel, Pm is given by 


Pm = 1 — 



VP) $(i>)" _ 1 di> 


(48) 


where </>( x) — {1/ \i2i:)e~ x / 2 is the standard normalized Gaussian probability density function and 
<I>(:r) = /_ 4>{u)du is the corresponding normalized Gaussian probability distribution function. For the 

AWGN-2 channel, we have 


Pm = 1 - r J- 8 ?— ^ ^ Vto)) m M ~ l dv (49) 

J-oo \ Po + P+ V V Po + P+ ) 

Note that, when p + — »■ oo and po = p, the AWGN-2 channel becomes the AWGN-1 channel, and 
Eq. (49) reduces to Eq. (48). 

For the Webb channels, first define 4>(a;; S' 2 ) = f* g S 2 )dw to be the cumulative probability distri- 
bution corresponding to the normalized Webb probability density function defined in Eq. (5). Then Pm 
for the Webb-1 channel is computed as 


Pm = 1 - / <j> (w - VP! S 2 ) $(«;; 5 2 ) M ~ l dw 

J — OO 


(50) 
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Similarly, for the Webb-2 channel, 


Pm 



P+ 

Po + P+ 


(V^T^ {w ~ ^ 5 S ^ M 1 


(51) 


Finally, for the Webb+Gaussian channel, 


Pm = 1 — 


P+ 


Po + P+ 


P+ 


Po + P+ 


(x - y/po) ■,S l ,/3 1 <F(x;<5 0 ,/3o) dx 


(52) 


where $(x; <5 2 , /?) = f\ 6 2 , (3)dy is the cumulative probability distribution corresponding to the 

standardized Webb+Gaussian probability density function defined in Eq. (6). 

The soft-decision capacities obtained in Section III can be compared with the hard-decision capacities 
obtained by plugging the various formulas for Pm into the general hard-decision capacity expression 
of Eq. (46). Figure 10 compares capacities for the hard-output and soft-output AWGN-2 channels for 
the case of M = 256. A similar comparison of capacities is shown in Fig. 11 for the hard-output and 
soft-output Webb-2 channels. The hard-output Webb-2 capacity was computed in [6]. 

The capacity curves for both the AWGN-2 and the Webb-2 channels show that a minimum value of 
Pb is reached at a nonzero code rate. Unlike the soft-output channels, which exhibit monotonically better 
efficiency in terms of the bit-normalized SNR parameter pb as the code rate (and hence the capacity 
per channel use) is reduced toward zero, the bit-normalized SNR efficiency of the hard-output channel 



Fig. 10. Capacity of 256-PPM on the hard- and soft-output 
AWGN-2 channels. 
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Fig. 11. Capacity of 256-PPM on the hard- and soft-output 
Webb-2 channels. 


worsens if the capacity is lowered below about 4 bits per channel use. This implies that an optimum code 
rate of about 1/2 will achieve the lowest pb for the hard-output channel, while the soft-output channel 
achieves lowest pb in the limit as the code rate goes to 0. 

Another comparison of capacity for the hard-output and soft-output Webb models is shown in Fig. 12, 
this time plotted versus fib. The hard-output capacity in this figure was computed in [6] for the general 
Webb+ Gaussian channel that also models the effects of thermal noise. 

VI. Capacity Results for Simplex Signaling 

In this section, we demonstrate a similarity between AWGN and Webb channels in the fundamental 
brick-wall limits for Ai-ary orthogonal signaling and simplex signaling. 

A. AWGN Channels 

Among all Af-ary signaling schemes on the AWGN-1 channel, the simplex set maximizes the capacity 
C at low Eb/No [7]. The simplex set achieves the same capacity as an M - ary orthogonal signaling scheme 
that uses a factor of M/(M — 1) more energy per bit. Applying the Shannon limit of Eb/No > In 2 ss 
— 1.59 dB, it follows that any M-ary orthogonal signaling scheme requires Eb/N 0 > (M In2)/(M — 1) for 
reliable communication to occur. Thus, each M-ary orthogonal signal has an associated Eb/No penalty of 
10 log 10 M/(M — 1) dB. This is illustrated in Fig. 13(a), where each M-ary orthogonal signaling capacity 
curve has a vertical asymptote at Eb/No > (Mln2)/(Af — 1). When translated to simplex signal sets 
by shifting left by 10 log 10 M/(M — 1) dB, the vertical asymptotes for each of the curves coincide at 
Eb/No = —1.59 dB, as shown in Fig. 13(b). 

B. Webb Channels 

On the Webb-2 channel, we observe the same behavior. In Fig. 14(a), capacity curves for Af-ary 
orthogonal signaling on the Webb-2 channel have varying vertical asymptotes. When translated by 
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AVERAGE BACKGROUND COUNT n b 

Fig. 12. Comparison of capacity for the hard- and soft-output 
Webb models. 




BIT SNR E b /N b , dB BIT SNR E b /N 0 , dB 

Fig. 13. AWGN-1 capacity versus bit SNR for (a) M - ary orthogonal signaling and (b) M - ary simplex signaling. 


101og 10 M/(M — 1) dB, as in Fig. 14(b), these vertical asymptotes coincide at —1.59 dB in the same 
way they did on the AWGN-1 channel. Although simplex signaling on the Webb-2 channel may not be 
practical because it would necessitate coherent detection instead of simple photon detection, this finding 
illustrates an important similarity between the brick-wall limits on the AWGN and Webb channels. In 
both cases, the restriction to a finite orthogonal signal set imposes the same fundamental penalty on the 
minimum achievable bit SNR. 
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BIT SNR P £>, dB BIT SNR p b , dB 

Fig. 14. Webb-2 capacity versus bit SNR for (a) M - ary orthogonal signaling and (b) M - ary simplex signaling. 


VII. Comparison of Capacity Results with Actual Code Performance 

Figure 15 shows the performance of Reed Solomon (RS) codes on GF(2 fc ) applied to 2 fe -PPM for 
Pb = 10~ 6 . For each curve, the alphabet size is fixed and, therefore, the Reed-Solomon codeword size is 
fixed at ( 2 k — 1) fc-bit symbols. The curves are obtained by varying the code rate within each RS code 
family. For these curves, the RS decoder is assumed to correct only errors (i.e. , no erasures), and the 
uncoded symbol-error probability is given by Eq. (48) with M = 2 k . As an example, the performance of 
the (255,223) RS code, with code rate approximately 7/8, is plotted at approximately 7 bits per channel 
use on the 8-bit RS curve in Fig. 15. This code requires 2.6 dB for Pb = HR 6 and is only 1.8 dB worse 
than the capacity limit achievable by arbitrary codes of the same rate for 256-PPM (see the gap marked 
with “B” in Fig. 15). The additional gap, marked as “A,” is due to constraining the 256-dimensional 
signal set to be orthogonal. Note that the comparison between the performance of RS codes and the 
two capacity limits does not account for the fact that the RS decoder uses hard quantized inputs while 
both capacity limits are computed for unconstrained channel output. This limitation must account for 
some portion of the nonoptimality of RS codes. Another interesting observation from Fig. 15 is that RS 
codes appear to be optimum approximately at rate 3/4 for all alphabet sizes. Lower-rate RS codes have 
progressively worse performance. However, Fig. 15 suggests the possibility of finding codes of rate 1/2 
or lower that can achieve Pb = ICR 6 at Eb/No < 0 dB. This would be a gain of 2.6 dB or more over the 
(255,223) RS code with 256-PPM. Thus, it is natural to look for codes of rate 1/2 and lower to attempt 
to approach capacity more closely than RS codes. 

Some results are available on simple binary turbo codes of rate 1/2 and 1/3 as compared with RS 
codes of the same rate. 4 (See also [4].) The performance of these codes is estimated in Fig. 15 by using 
the following approximation. The required input symbol-error probabilities to achieve Pb = ICR 6 with 
RS codes or turbo codes are related to the Eb/N 0 required for uncoded 256-PPM to obtain those symbol- 
error probabilities. The difference in required Eb/No between RS and turbo codes is used to plot the 
turbo-code performance relative to the RS performance. This is an approximate comparison because the 
turbo codes were not simulated exactly on a PPM system with Gaussian noise. These results indicate 
that, while these binary turbo codes do outperform RS codes of the same rate, there still remains a gap 
of several dB to the capacity limit. 


4 Data Compression and Channel Coding, X2000 Report (internal document), Jet Propulsion Laboratory, Pasadena, Cali- 
fornia, September 15, 1997. 


24 



VIII. Conclusions and Future Work 

This article has analyzed several idealized channel models that can be used to approximate PPM 
signaling over an APD-detected optical communication channel. We were able to define a suitable bit- 
normalized SNR parameter, pb, such that all of these channels with soft outputs yield quasi-brick-wall 
thresholds on the minimum acceptable value of pb above which reliable communication is theoretically 
possible and below which it is not possible except at extremely low channel rates. Furthermore, under all 
of these models with soft channel outputs, the bit-SNR thresholds for different values of M differ from 
each other by the same “simplex-to-orthogonal penalty,” (M — 1 )/M. Under both the AWGN-2 and 
Webb-2 models, the gap between the capacities of hard- and soft-output channels is about 3 dB at the 
code rate giving the optimum hard-output bit SNR. On the hard-output channels, there is an optimum 
M beyond which capacity is diminished [6] because much of the small incremental information available 
from each slot is destroyed when all of that information must be summarized as a single decision among 
an increasing number of candidate slots. This contrasts sharply with the results for soft decisions, for 
which larger M gives uniformly better capacity under each model. 

The capacities computed in this article assume that there are no time gaps between successive channel 
uses. However, laser transmitters require a sufficient recharging time between pulses, which is guaranteed 
by introducing a dead time, T d , between successive channel uses, as shown in Fig. 16. This reduces the 
capacity computed above by a factor ( T w + T d )/T w , where T w = MT S is the PPM word duration. Since 
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this factor can be large, it is important to optimize the trade-off between achievable peak power and 
dead time. A more efficient and more information-theoretically sound method to guarantee a minimum 
spacing, T d , between pulses is to design a code that inherently satisfies such a constraint. Codes of this 
kind, usually called (d, k) codes, have been used extensively in the magnetic recording field. In these 
codes, at least d Os must follow every 1, and not more than k Os can follow a 1. Here a 1 corresponds to a 
pulse and a 0 to no pulse. For laser transmission, the k constraint is not applicable, but the d constraint 
directly reflects the need for dead time. In a future article, we will investigate how (d, k) codes can be 
designed to maximize the amount of information transmitted subject to this type of constraint. 


ONE CHANNEL USE 

◄ ► 


PPM 

SLOT 




T d T w = MT S PPM WORD 

Fig. 16. PPM signaling (M= 4, k= 2). 


References 


[1] S. Butman and M. J. Klass, “Capacity of Noncoherent Channels,” The Deep 
Space Network Progress Report, September and October 1973, JPL Technical 
Report 32-1526, vol. XVIII, Jet Propulsion Laboratory, Pasadena, California 
pp. 85-93, December 15, 1973. 

[2] F. M. Davidson and X. Sun, “Gaussian Approximation Versus Nearly Exact 
Performance Analysis of Optical Communication Systems with PPM Signaling 
and APD Receivers,” IEEE Trans. Commun., vol. 36, no. 11, pp. 1185-1192, 
November 1988. 

[3] R. G. Gallager, Information Theory and Reliable Communications, New York: 
Wiley, 1968. 

[4] J. Hamkins and M. Srinivasan, “Turbo Codes for APD-Detected PPM,” Allerton 
Conference, Monticello, Illinois, October 1998. 

[5] S. J. MacMullan and O. M. Collins, “The Capacity of Orthogonal and Bi- 
Orthogonal Codes on the Gaussian Channel,” IEEE Trans, on Information The- 
ory, vol. 44, no. 3, May 1998. 

[6] J. Hamkins, “The Capacity of Avalanche Photodiode-Detected Pulse-Position 
Modulation,” The Telecommunications and Mission Operations Progress Re- 
port 43-138, April-June 1999, Jet Propulsion Laboratory, Pasadena, California, 
pp. 1-19, August 15, 1999. 

http://tmo.jpl.nasa.gov/tmo/progress_report/42-138/138A.pdf 

[7] J. L. Massey, “All Signal Sets Centered About the Origin Are Optimal at Low 
Energy-to-Noise Ratios on the AWGN channel,” Proc. Int. Symp. Information 
Theory , Ronneby, Sweden, pp. 80-81, June 1976. 


26 


[8] K. R. Baker, “On the WMC Density as an Inverse Gaussian Probability Density,” 
IEEE Trans. Commun., vol. 44, no. 1, pp. 15-17, January 1996. 

[9] J. T. K. Tang and K. B. Letaief, “The Use of WMC Distribution for Performance 
Evaluation of APD Optical Communication Systems,” IEEE Trans Commun., 
vol. 46, no. 2, pp. 279-285, February 1998. 

[10] V. Vilnrotter, M. Simon, and M. Srinivasan, “Maximum Likelihood Detection of 
PPM Signals Governed by an Arbitrary Point Process Plus Additive Gaussian 
Noise,” JPL Publication 98-7, Jet Propulsion Laboratory, Pasadena, California, 
April 1998. 

[11] P. P. Webb, R. J. McIntyre, and J. Conradi, “Properties of Avalanche Photodi- 
odes,” RCA Review, vol. 35, pp. 234-278, June 1974. 


27 



Appendix A 

Parameters and Notation 


The following is a description of parameters and notation in this article, along with nominal values 
used in the numerical results. 


Laser and modulator parameters 

V 

1064 

Optical frequency, nm 

M 

2-256 

PPM order 

T a 

3.125 x 10“® 

Width of the PPM slot required by the laser, s 

T d 

4.32 x 10 -4 

Dead time between PPM symbols required by the laser, s 

OCer 

10 6 

Modulation extinction ratio 

APD detector parameters 

V 

38 percent 

Quantum efficiency 

k e ff 

0.007 

Ionization ratio 

G 

50-200 

Gain 

F 

2. 2-3.4 

Excess noise factor, F = h e ffG + (2 — 1/G)(1 — k e ff) 

T 

300 

Noise temperature, K 

Rl 

179,700 

Load resistance (transimpedance model), 5.75 x 10 12 X T s , 

B 

1/2T S 

Noise equivalent one-sided bandwidth, Hz 

h 

4 x 10 -14 

Bulk leakage current, A 

Is 

2 x 10“ 9 

Surface leakage current, A 

n 


Mean photons incident on the photodetector, per pulse (= or n s ) 

n b 

0.001-10,000 

Mean background photons incident on the photodetector, per slot 

n s 

100 

Mean signal photons incident on the photodetector, per pulse 

Physical constants 

e_ 

1.38 x 10“ 23 

Electron charge, C 

h 

6.7 x 10“ 34 

Planck’s constant, J/Hz 

K 

1.6 x 10“ 19 

Boltzmann’s constant, J/K 

Mathematical notation 

C 


Capacity, bits per channel use 

m, mo, mi 


Mean of Gaussian or Webb distribution 



Variance of Gaussian or Webb distribution 

5 2 ,S 2 ,S 2 


Skewness parameter of Webb distribution 

0,00,01 


Blending fraction for Webb+ Gaussian distribution 

p 


Equals m 2 /cr 2 (AWGN-1 or Webb-1 channel) 

PO 


Equals (mi — mo) 2 /^ (AWGN-2 or Webb-2 channel) 

PO 


Equals (mi — ?tIq)/(ctq + cr' 2 ) (Webb+Gaussian channel) 
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Mathematical notation (cont’d) 


Pb 

The minimum bit SNR at the capacity limit (equals {E^/Nq) on 

Pb 

f p/(2C) AWGN-1 or Webb-1 channel 
qUa S \ Po/(2C) AWGN-2 or Webb-2 or Webb+Gaussian channel 

p+ 

Equals (mi — mo) 2 /(crf — <Jq) 

A 

Equals Sf — Sq 

q 

Number of electrons generated at the output of the APD 

Pm 

Probability of uncoded M - PPM symbol error 

<t>(x) 

Gaussian probability density function, l/(\/27r)e — x / 2 

Q(x) 

Gaussian cumulative distribution function, f X <f>(y)dy 

<t>(w\8 2 ) 

Webb probability density function, see Eq. (5) 

$(u>; <5 2 ) 

Webb cumulative distribution function, 0(u; S 2 )du 

0(x; 8 2 , 0) 

Webb+Gaussian probability density function, see Eq. (6) 

<S>(x;5 2 ,/3) 

Webb+Gaussian cumulative distribution function, f X Qo 4>{y, $ 2 , (3)dy 
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Appendix B 

Moments of the Standardized Webb Probability Distribution 


Let W be distributed according to the standardized Webb density 


,, „ 2 \ 1 ( i w \~ 3 / 2 


i/2tT 


— w 




w > —8 


Let X = 1 + W/8. Then the probability density function of X is related to the inverse Gaussian density 
[8,9] by 


fx(x) = = x 3/2 exp 

V 


5 2 {x — 1 ) 
2x 


21 


= f(x\ M 2 ) 


It follows that the moment generator function for W is given by 


/ OO 

e sw p(w)dv 


/»oo 

/ e sS< - x ~ 1 ' 1 f(x; 1 , 5 2 )dx 

JO 


ss rsx -*/ 2 

e I — 7 ==- exp 




8 2 (x — l ) 2 
2x 


s8x 


dx 


-sS 


r°° 8x~ 3 / 2 


V2 n 


exp 


<5 2 [ 1 - ^ 
0 


o 


- 1 / 2 ' 


21 


2x 


exp 


8 2 1 - 




dx 


/(x;(l — 2s/<5) -1 / 2 ,<5 2 ) 


= exp 




Thus, the mean of W is M'( 0) = 0, the variance is M"( 0) = 1, and each moment of IT is a polynomial 
in the parameter 1 / <5 2 . 
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Appendix C 

Approximation to M-ary PPM AWGN Capacity for Large M 


To evaluate the capacity of the AWGN-2 channel for large M, we rewrite Eq. (31) as 


C = log 2 M - E v | X1 log 2 


M 


* + £■ 

i= 2 


(C-l) 


where Zj = [po/{po + p+)](u|/ 2) + [p + /(p 0 + p + )]^/poVj. When M is large, the sum of random variables 


J2jL 2 eZj converges to ( M — 1) times the mean of each e Zj , because {zj, j = 2, • • • , M} are independent 
and identically distributed, given x x . Thus, the capacity expression can be approximated by 


C « log 2 M - E Vl | Xl log 2 [l + e 21 (M - l)e z p 


(C-2) 


where e z i denotes the mean of e Zj . It remains to evaluate e z i . 

For any j > 1, Zj is a quadratic function of an 1V(0, 1) random variable Vj. To evaluate E v 
when v is N{ 0, 1), we write out the integral 

1 


0 av 2 /2+bv 


E 1} 


0 av 2 /2+bv 


J-c 


1 


V2tt J — oc 

i r°° 


V2n J-, 
1 


e -v 2 /^ e av 2 /2 e bv dv 

e ~(l-a)[v 2 /2-bv/(l-a)] dv 

e -([l-a]/2)(^-[h/(l-a)]) 2 e ([l-a\/ 2 )(b/[l-a}) 2 dv 


2ir 


V 1 - a 

1 


,(b 2 l2)l(l-a) if a<1 




(672)/(l-a) 


(C-3) 


Since Zj is of the above quadratic form with a = po/{po + P+) < 1 and b = yfpo P+/(Po + p+), we have 
e z i = \J ( po + p+)/p+ exp ([pop + /2]/[po + p+]). Thus, the approximate capacity expression is 


C « log 2 M - E V1 | X1 log 2 


= log 2 M - E Vl | Xl log 2 


1 + (M — 1)\/ P ° + P+ exp 
P+ 


PoP+ 


Po + P+ 


1 + ( M - 1 )J^±P± exp ( p0p+ ~ 2p+ ^ ~ PoV i 


P+ 


V 2 (po + P+) 


(C-4) 


where V\ given Xi is N(^/po 1 [p 0 + p+]/p+), as in Eq. (28). Equation (C-4) is valid for the AWGN-2 
channel and reduces to the AWGN-1 channel expression in Eq. (24) in the limit as p + — > oo, with p 0 
replaced by p. 
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